Imitation Learning in The Game of Go with Joseki Options
نویسندگان
چکیده
Scaling reinforcement learning methods to large, challenging decision making tasks can potentially benefit from integrating domain specific knowledge in a principled manner. This synthesis focuses on applying two forms of domain knowledge about the game of Go to improve learning performance on what continues to be an extremely challenging task. First, learning is bootstrapped by using reinforcement learning to learn to imitate expert Go players. This utilizes databases of expert game records as a source of training experiences. Second, reusable options are automatically created from a joseki database, and reinforcement learning is used to learn when to apply these joseki during a game. Together these improve the performance of a reinforcement learning agent against a much better computer Go player.
منابع مشابه
Design and Implementation of a Heuristic beginning Game System for Computer Go
There are roughly three stages in a Go game: the beginning game, the middle game, and the end game. This paper describes a computer Go beginning game system which includes occupying corners, joseki, extending edges, and dealing with Moyo. This beginning game system has been used in a computer Go program named Jimmy 4.0. Having been tested by professional Go players, this system is estimated at ...
متن کاملActive Opening Book Application for Monte-Carlo Tree Search in 19×19 Go
The dominant approach for programs playing the Asian board game of Go is nowadays Monte-Carlo Tree Search (MCTS). However, MCTS does not perform well in the opening phase of the game, as the branching factor is high and consequences of moves can be far delayed. Human knowledge about Go openings is typically captured in joseki, local sequences of moves that are considered optimal for both player...
متن کاملApplying Data Mining to the Study of Joseki
Go is a strategic two player boardgame. Many studies have been done with regard to go in general, and to joseki, localized exchanges of stones that are considered fair for both players. We give an algorithm that finds and catalogues as many joseki as it can, as well as the global circumstances under which they are likely to be played, by analyzing a large number of professional go games. The me...
متن کاملCombination of real options and game-theoretic approach in investment analysis
Investments in technology create a large amount of capital investments by major companies. Assessing such investment projects is identified as critical to the efficient assignment of resources. Viewing investment projects as real options, this paper expands a method for assessing technology investment decisions in the linkage existence of uncertainty and competition. It combines the game-theore...
متن کاملAn Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic
This paper, presents an adapted serious game for rating social ability in children with autism spectrum disorder (ASD). The required measurements are obtained by challenges of the proposed serious game. The proposed serious game uses reinforcement learning concepts for being adaptive. It is based on fuzzy logic to evaluate the social ability level of the children with ASD. The game adapts itsel...
متن کامل